Comparing Approaches for Automatic Question Identification
نویسندگان
چکیده
Collecting spontaneous speech corpora that are open-ended, yet topically constrained, is increasingly popular for research in spoken dialogue systems and speaker state, inter alia. Typically, these corpora are labeled by human annotators, either in the lab or through crowdsourcing; however, this is cumbersome and time-consuming for large corpora. We present four different approaches to automatically tagging a corpus when general topics of the conversations are known. We develop these approaches on the Columbia X-Cultural Deception corpus and find accuracy that significantly exceeds the baseline. Finally, we conduct a cross-corpus evaluation by testing the best performing approach on the Columbia/SRI/Colorado corpus.
منابع مشابه
Automatic Identification and Classification of the Iranian Traditional Music Scales (Dastgāh) and Melody Models (Gusheh): Analytical and Comparative Review on Conducted Research
Background and Aim: Automatic identification and classification of the Iranian traditional music scales (Dastgāh) and melody models (Gusheh) has attracted the attention of the researchers for more than a decade. The current research aims to review conducted researches on this area and consider its different approached and obstacles. Method: The research approach is content analysis and data col...
متن کاملManaged Pressure Drilling Using Integrated Process Control
Control of wellbore pressure during drilling operations has always been important in the oil industry as this can prevent the possibility of well blowout. The present research employs a combination of automatic process control and statistical process control for the first time for the identification, monitoring, and control of both random and special causes in drilling operations. To this end, ...
متن کاملKohonen Self Organizing for Automatic Identification of Cartographic Objects
Automatic identification and localization of cartographic objects in aerial and satellite images have gained increasing attention in recent years in digital photogrammetry and remote sensing. Although the automatic extraction of man made objects in essence is still an unresolved issue, the man made objects can be extracted from aerial photos and satellite images. Recently, the high-resolution s...
متن کاملApproaches to Named Entity Recognition in Indian Languages: A Study
Named Entity Recognition (NER) is subtask of information extraction that seeks to locate and classify the elements in some text into pre-defined categories. NER finds its application in Natural Language Processing tasks like machine translation, question-answering systems and automatic summarization. The approaches to NER are rule based, statistics based or a combination of both. In this paper,...
متن کاملIdentification of versions of the same musical composition: audio content-based approaches and post-processing steps
This book focuses on the automatic identification of musical piece versions (alternate renditions of the same musical composition like cover songs, live recordings, remixes, etc.). In particular, two core approaches for version identification are proposed: model-free and model-based. Furthermore, the book introduces the use of post-processing strategies to improve the identification of versions...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017